Overview
Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 16209 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.8 MiB |
| Average record size in memory | 184.0 B |
Variable types
| Numeric | 18 |
|---|---|
| DateTime | 1 |
| Categorical | 4 |
bathrooms is highly overall correlated with bedrooms and 6 other fields | High correlation |
bedrooms is highly overall correlated with bathrooms and 2 other fields | High correlation |
floors is highly overall correlated with bathrooms and 3 other fields | High correlation |
grade is highly overall correlated with bathrooms and 6 other fields | High correlation |
lat is highly overall correlated with zip_median_price and 1 other fields | High correlation |
long is highly overall correlated with zipcode | High correlation |
price is highly overall correlated with grade and 4 other fields | High correlation |
sqft_above is highly overall correlated with bathrooms and 6 other fields | High correlation |
sqft_living is highly overall correlated with bathrooms and 5 other fields | High correlation |
sqft_living15 is highly overall correlated with bathrooms and 4 other fields | High correlation |
sqft_lot is highly overall correlated with sqft_lot15 | High correlation |
sqft_lot15 is highly overall correlated with sqft_lot | High correlation |
view is highly overall correlated with waterfront | High correlation |
waterfront is highly overall correlated with view | High correlation |
yr_built is highly overall correlated with bathrooms and 2 other fields | High correlation |
zip_median_price is highly overall correlated with lat and 2 other fields | High correlation |
zip_tier is highly overall correlated with lat and 1 other fields | High correlation |
zipcode is highly overall correlated with long | High correlation |
waterfront is highly imbalanced (94.0%) | Imbalance |
view is highly imbalanced (72.1%) | Imbalance |
sqft_basement has 9882 (61.0%) zeros | Zeros |
yr_renovated has 15537 (95.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-29 09:44:33.772293 |
|---|---|
| Analysis finished | 2025-12-29 09:45:20.974681 |
| Duration | 47.2 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
| Distinct | 16110 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5757708 × 109 |
| Minimum | 1000102 |
|---|---|
| Maximum | 9.9000002 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 1000102 |
|---|---|
| 5-th percentile | 4.7500126 × 108 |
| Q1 | 2.1230492 × 109 |
| median | 3.9049502 × 109 |
| Q3 | 7.304301 × 109 |
| 95-th percentile | 9.2943006 × 109 |
| Maximum | 9.9000002 × 109 |
| Range | 9.8990001 × 109 |
| Interquartile range (IQR) | 5.1812518 × 109 |
Descriptive statistics
| Standard deviation | 2.8746614 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.62823544 |
| Kurtosis | -1.2565548 |
| Mean | 4.5757708 × 109 |
| Median Absolute Deviation (MAD) | 2.3984503 × 109 |
| Skewness | 0.24214742 |
| Sum | 7.4168669 × 1013 |
| Variance | 8.2636782 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1545800290 | 2 | < 0.1% |
| 4031000520 | 2 | < 0.1% |
| 3523069060 | 2 | < 0.1% |
| 1922059278 | 2 | < 0.1% |
| 2621600015 | 2 | < 0.1% |
| 4305200070 | 2 | < 0.1% |
| 8161020060 | 2 | < 0.1% |
| 1423049019 | 2 | < 0.1% |
| 5101405604 | 2 | < 0.1% |
| 2724049222 | 2 | < 0.1% |
| Other values (16100) | 16189 |
| Value | Count | Frequency (%) |
| 1000102 | 1 | |
| 1200019 | 1 | |
| 1200021 | 1 | |
| 2800031 | 1 | |
| 3600057 | 1 | |
| 5200087 | 1 | |
| 7200080 | 1 | |
| 7200179 | 1 | |
| 7400062 | 1 | |
| 7600057 | 1 |
| Value | Count | Frequency (%) |
| 9900000190 | 1 | |
| 9895000040 | 1 | |
| 9842300540 | 1 | |
| 9842300485 | 1 | |
| 9839301165 | 1 | |
| 9839301055 | 1 | |
| 9839300875 | 1 | |
| 9839300775 | 1 | |
| 9839300545 | 1 | |
| 9839300285 | 1 |
date
Date
| Distinct | 366 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 126.8 KiB |
| Minimum | 2014-05-02 00:00:00 |
|---|---|
| Maximum | 2015-05-24 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
price
Real number (ℝ)
High correlation
| Distinct | 3428 |
|---|---|
| Distinct (%) | 21.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 537470.28 |
| Minimum | 75000 |
|---|---|
| Maximum | 7700000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 75000 |
|---|---|
| 5-th percentile | 210000 |
| Q1 | 320000 |
| median | 450000 |
| Q3 | 640000 |
| 95-th percentile | 1150000 |
| Maximum | 7700000 |
| Range | 7625000 |
| Interquartile range (IQR) | 320000 |
Descriptive statistics
| Standard deviation | 360303.58 |
|---|---|
| Coefficient of variation (CV) | 0.6703693 |
| Kurtosis | 37.106004 |
| Mean | 537470.28 |
| Median Absolute Deviation (MAD) | 150000 |
| Skewness | 4.0330623 |
| Sum | 8.7118558 × 109 |
| Variance | 1.2981867 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 450000 | 135 | 0.8% |
| 350000 | 134 | 0.8% |
| 425000 | 121 | 0.7% |
| 550000 | 113 | 0.7% |
| 325000 | 111 | 0.7% |
| 375000 | 109 | 0.7% |
| 500000 | 107 | 0.7% |
| 400000 | 106 | 0.7% |
| 525000 | 103 | 0.6% |
| 250000 | 100 | 0.6% |
| Other values (3418) | 15070 |
| Value | Count | Frequency (%) |
| 75000 | 1 | < 0.1% |
| 80000 | 1 | < 0.1% |
| 81000 | 1 | < 0.1% |
| 82000 | 1 | < 0.1% |
| 84000 | 1 | < 0.1% |
| 85000 | 2 | |
| 86500 | 1 | < 0.1% |
| 90000 | 4 | |
| 92000 | 1 | < 0.1% |
| 95000 | 4 |
| Value | Count | Frequency (%) |
| 7700000 | 1 | |
| 7062500 | 1 | |
| 6885000 | 1 | |
| 5110800 | 1 | |
| 4668000 | 1 | |
| 4489000 | 1 | |
| 4208000 | 1 | |
| 3800000 | 2 | |
| 3710000 | 1 | |
| 3635000 | 1 |
bedrooms
Real number (ℝ)
High correlation
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3678203 |
| Minimum | 0 |
|---|---|
| Maximum | 33 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.93327008 |
|---|---|
| Coefficient of variation (CV) | 0.27711397 |
| Kurtosis | 63.747881 |
| Mean | 3.3678203 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.4194036 |
| Sum | 54589 |
| Variance | 0.87099303 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 7380 | |
| 4 | 5128 | |
| 2 | 2098 | 12.9% |
| 5 | 1213 | 7.5% |
| 6 | 197 | 1.2% |
| 1 | 142 | 0.9% |
| 7 | 26 | 0.2% |
| 8 | 9 | 0.1% |
| 0 | 8 | < 0.1% |
| 9 | 5 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8 | < 0.1% |
| 1 | 142 | 0.9% |
| 2 | 2098 | 12.9% |
| 3 | 7380 | |
| 4 | 5128 | |
| 5 | 1213 | 7.5% |
| 6 | 197 | 1.2% |
| 7 | 26 | 0.2% |
| 8 | 9 | 0.1% |
| 9 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| 9 | 5 | < 0.1% |
| 8 | 9 | 0.1% |
| 7 | 26 | 0.2% |
| 6 | 197 | 1.2% |
| 5 | 1213 | 7.5% |
| 4 | 5128 | |
| 3 | 7380 | |
| 2 | 2098 | 12.9% |
bathrooms
Real number (ℝ)
High correlation
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1130545 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 7 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.5 |
| median | 2.25 |
| Q3 | 2.5 |
| 95-th percentile | 3.5 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.76524195 |
|---|---|
| Coefficient of variation (CV) | 0.36214966 |
| Kurtosis | 1.0038592 |
| Mean | 2.1130545 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.46152496 |
| Sum | 34250.5 |
| Variance | 0.58559524 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.5 | 4064 | |
| 1 | 2891 | |
| 1.75 | 2283 | |
| 2.25 | 1532 | 9.5% |
| 2 | 1424 | 8.8% |
| 1.5 | 1094 | 6.7% |
| 2.75 | 913 | 5.6% |
| 3 | 547 | 3.4% |
| 3.5 | 544 | 3.4% |
| 3.25 | 441 | 2.7% |
| Other values (19) | 476 | 2.9% |
| Value | Count | Frequency (%) |
| 0 | 7 | < 0.1% |
| 0.5 | 3 | < 0.1% |
| 0.75 | 51 | 0.3% |
| 1 | 2891 | |
| 1.25 | 8 | < 0.1% |
| 1.5 | 1094 | 6.7% |
| 1.75 | 2283 | |
| 2 | 1424 | 8.8% |
| 2.25 | 1532 | 9.5% |
| 2.5 | 4064 |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7.75 | 1 | < 0.1% |
| 6.75 | 1 | < 0.1% |
| 6.5 | 1 | < 0.1% |
| 6.25 | 1 | < 0.1% |
| 6 | 3 | < 0.1% |
| 5.75 | 2 | < 0.1% |
| 5.5 | 6 | < 0.1% |
| 5.25 | 11 | |
| 5 | 17 |
sqft_living
Real number (ℝ)
High correlation
| Distinct | 881 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2073.2746 |
| Minimum | 290 |
|---|---|
| Maximum | 12050 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 290 |
|---|---|
| 5-th percentile | 940 |
| Q1 | 1430 |
| median | 1910 |
| Q3 | 2550 |
| 95-th percentile | 3740 |
| Maximum | 12050 |
| Range | 11760 |
| Interquartile range (IQR) | 1120 |
Descriptive statistics
| Standard deviation | 907.00949 |
|---|---|
| Coefficient of variation (CV) | 0.43747678 |
| Kurtosis | 4.2573129 |
| Mean | 2073.2746 |
| Median Absolute Deviation (MAD) | 540 |
| Skewness | 1.3787615 |
| Sum | 33605708 |
| Variance | 822666.22 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 103 | 0.6% |
| 1400 | 102 | 0.6% |
| 1250 | 101 | 0.6% |
| 1300 | 101 | 0.6% |
| 1010 | 98 | 0.6% |
| 1540 | 97 | 0.6% |
| 1720 | 97 | 0.6% |
| 1440 | 96 | 0.6% |
| 1820 | 96 | 0.6% |
| 1650 | 95 | 0.6% |
| Other values (871) | 15223 |
| Value | Count | Frequency (%) |
| 290 | 1 | |
| 370 | 1 | |
| 380 | 1 | |
| 390 | 2 | |
| 420 | 2 | |
| 430 | 1 | |
| 440 | 1 | |
| 460 | 1 | |
| 470 | 2 | |
| 480 | 2 |
| Value | Count | Frequency (%) |
| 12050 | 1 | |
| 10040 | 1 | |
| 9890 | 1 | |
| 9640 | 1 | |
| 8020 | 1 | |
| 8010 | 1 | |
| 7880 | 1 | |
| 7850 | 1 | |
| 7730 | 1 | |
| 7440 | 1 |
sqft_lot
Real number (ℝ)
High correlation
| Distinct | 8048 |
|---|---|
| Distinct (%) | 49.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14867.673 |
| Minimum | 520 |
|---|---|
| Maximum | 1164794 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 520 |
|---|---|
| 5-th percentile | 1755 |
| Q1 | 5004 |
| median | 7599 |
| Q3 | 10631 |
| 95-th percentile | 43010.2 |
| Maximum | 1164794 |
| Range | 1164274 |
| Interquartile range (IQR) | 5627 |
Descriptive statistics
| Standard deviation | 38825.702 |
|---|---|
| Coefficient of variation (CV) | 2.6114175 |
| Kurtosis | 209.17359 |
| Mean | 14867.673 |
| Median Absolute Deviation (MAD) | 2616 |
| Skewness | 11.407202 |
| Sum | 2.4099011 × 108 |
| Variance | 1.5074351 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 285 | 1.8% |
| 6000 | 197 | 1.2% |
| 4000 | 193 | 1.2% |
| 7200 | 165 | 1.0% |
| 4800 | 92 | 0.6% |
| 7500 | 90 | 0.6% |
| 9600 | 89 | 0.5% |
| 4500 | 82 | 0.5% |
| 3600 | 80 | 0.5% |
| 8400 | 78 | 0.5% |
| Other values (8038) | 14858 |
| Value | Count | Frequency (%) |
| 520 | 1 | |
| 609 | 1 | |
| 638 | 1 | |
| 649 | 2 | |
| 651 | 1 | |
| 675 | 1 | |
| 676 | 1 | |
| 681 | 1 | |
| 683 | 1 | |
| 690 | 1 |
| Value | Count | Frequency (%) |
| 1164794 | 1 | |
| 1074218 | 1 | |
| 1024068 | 1 | |
| 982998 | 1 | |
| 920423 | 1 | |
| 871200 | 1 | |
| 715690 | 1 | |
| 533610 | 1 | |
| 505166 | 1 | |
| 503989 | 1 |
floors
Real number (ℝ)
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4988278 |
| Minimum | 1 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1.5 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 3.5 |
| Range | 2.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.54303211 |
|---|---|
| Coefficient of variation (CV) | 0.36230453 |
| Kurtosis | -0.48039768 |
| Mean | 1.4988278 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.61531878 |
| Sum | 24294.5 |
| Variance | 0.29488387 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 7970 | |
| 2 | 6215 | |
| 1.5 | 1414 | 8.7% |
| 3 | 489 | 3.0% |
| 2.5 | 117 | 0.7% |
| 3.5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 7970 | |
| 1.5 | 1414 | 8.7% |
| 2 | 6215 | |
| 2.5 | 117 | 0.7% |
| 3 | 489 | 3.0% |
| 3.5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.5 | 4 | < 0.1% |
| 3 | 489 | 3.0% |
| 2.5 | 117 | 0.7% |
| 2 | 6215 | |
| 1.5 | 1414 | 8.7% |
| 1 | 7970 |
waterfront
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 126.8 KiB |
| 0 | |
|---|---|
| 1 | 113 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 16096 | |
| 1 | 113 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 16096 | |
| 1 | 113 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 16096 | |
| 1 | 113 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 16096 | |
| 1 | 113 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 16096 | |
| 1 | 113 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 16096 | |
| 1 | 113 | 0.7% |
view
Categorical
High correlation Imbalance
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 126.8 KiB |
| 0 | |
|---|---|
| 2 | 743 |
| 3 | 375 |
| 1 | 254 |
| 4 | 233 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 14604 | |
| 2 | 743 | 4.6% |
| 3 | 375 | 2.3% |
| 1 | 254 | 1.6% |
| 4 | 233 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 14604 | |
| 2 | 743 | 4.6% |
| 3 | 375 | 2.3% |
| 1 | 254 | 1.6% |
| 4 | 233 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14604 | |
| 2 | 743 | 4.6% |
| 3 | 375 | 2.3% |
| 1 | 254 | 1.6% |
| 4 | 233 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14604 | |
| 2 | 743 | 4.6% |
| 3 | 375 | 2.3% |
| 1 | 254 | 1.6% |
| 4 | 233 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14604 | |
| 2 | 743 | 4.6% |
| 3 | 375 | 2.3% |
| 1 | 254 | 1.6% |
| 4 | 233 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14604 | |
| 2 | 743 | 4.6% |
| 3 | 375 | 2.3% |
| 1 | 254 | 1.6% |
| 4 | 233 | 1.4% |
condition
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 126.8 KiB |
| 3 | |
|---|---|
| 4 | |
| 5 | |
| 2 | 131 |
| 1 | 25 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 4 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 10538 | |
| 4 | 4238 | |
| 5 | 1277 | 7.9% |
| 2 | 131 | 0.8% |
| 1 | 25 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 10538 | |
| 4 | 4238 | |
| 5 | 1277 | 7.9% |
| 2 | 131 | 0.8% |
| 1 | 25 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 10538 | |
| 4 | 4238 | |
| 5 | 1277 | 7.9% |
| 2 | 131 | 0.8% |
| 1 | 25 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 10538 | |
| 4 | 4238 | |
| 5 | 1277 | 7.9% |
| 2 | 131 | 0.8% |
| 1 | 25 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 10538 | |
| 4 | 4238 | |
| 5 | 1277 | 7.9% |
| 2 | 131 | 0.8% |
| 1 | 25 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 10538 | |
| 4 | 4238 | |
| 5 | 1277 | 7.9% |
| 2 | 131 | 0.8% |
| 1 | 25 | 0.2% |
grade
Real number (ℝ)
High correlation
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.6529706 |
| Minimum | 1 |
|---|---|
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 7 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 13 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1710497 |
|---|---|
| Coefficient of variation (CV) | 0.15301897 |
| Kurtosis | 1.2123456 |
| Mean | 7.6529706 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.75343493 |
| Sum | 124047 |
| Variance | 1.3713573 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 6761 | |
| 8 | 4563 | |
| 9 | 1943 | 12.0% |
| 6 | 1511 | 9.3% |
| 10 | 861 | 5.3% |
| 11 | 286 | 1.8% |
| 5 | 183 | 1.1% |
| 12 | 63 | 0.4% |
| 4 | 24 | 0.1% |
| 13 | 10 | 0.1% |
| Other values (2) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 24 | 0.1% |
| 5 | 183 | 1.1% |
| 6 | 1511 | 9.3% |
| 7 | 6761 | |
| 8 | 4563 | |
| 9 | 1943 | 12.0% |
| 10 | 861 | 5.3% |
| 11 | 286 | 1.8% |
| Value | Count | Frequency (%) |
| 13 | 10 | 0.1% |
| 12 | 63 | 0.4% |
| 11 | 286 | 1.8% |
| 10 | 861 | 5.3% |
| 9 | 1943 | 12.0% |
| 8 | 4563 | |
| 7 | 6761 | |
| 6 | 1511 | 9.3% |
| 5 | 183 | 1.1% |
| 4 | 24 | 0.1% |
sqft_above
Real number (ℝ)
High correlation
| Distinct | 803 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1784.7544 |
| Minimum | 290 |
|---|---|
| Maximum | 8860 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 290 |
|---|---|
| 5-th percentile | 850 |
| Q1 | 1200 |
| median | 1560 |
| Q3 | 2200 |
| 95-th percentile | 3390 |
| Maximum | 8860 |
| Range | 8570 |
| Interquartile range (IQR) | 1000 |
Descriptive statistics
| Standard deviation | 821.82084 |
|---|---|
| Coefficient of variation (CV) | 0.46046719 |
| Kurtosis | 3.2764936 |
| Mean | 1784.7544 |
| Median Absolute Deviation (MAD) | 450 |
| Skewness | 1.4303529 |
| Sum | 28929084 |
| Variance | 675389.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1300 | 158 | 1.0% |
| 1010 | 158 | 1.0% |
| 1200 | 151 | 0.9% |
| 1220 | 148 | 0.9% |
| 1140 | 140 | 0.9% |
| 1340 | 140 | 0.9% |
| 1250 | 140 | 0.9% |
| 1400 | 139 | 0.9% |
| 1180 | 133 | 0.8% |
| 1320 | 131 | 0.8% |
| Other values (793) | 14771 |
| Value | Count | Frequency (%) |
| 290 | 1 | |
| 370 | 1 | |
| 380 | 1 | |
| 390 | 2 | |
| 420 | 2 | |
| 430 | 1 | |
| 440 | 1 | |
| 460 | 1 | |
| 470 | 2 | |
| 480 | 2 |
| Value | Count | Frequency (%) |
| 8860 | 1 | |
| 8570 | 1 | |
| 8020 | 1 | |
| 7880 | 1 | |
| 7850 | 1 | |
| 7680 | 1 | |
| 7420 | 1 | |
| 7320 | 1 | |
| 6660 | 1 | |
| 6430 | 1 |
sqft_basement
Real number (ℝ)
Zeros
| Distinct | 280 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 288.5202 |
| Minimum | 0 |
|---|---|
| Maximum | 4820 |
| Zeros | 9882 |
| Zeros (%) | 61.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 560 |
| 95-th percentile | 1170 |
| Maximum | 4820 |
| Range | 4820 |
| Interquartile range (IQR) | 560 |
Descriptive statistics
| Standard deviation | 438.59891 |
|---|---|
| Coefficient of variation (CV) | 1.5201671 |
| Kurtosis | 2.6994341 |
| Mean | 288.5202 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.571497 |
| Sum | 4676624 |
| Variance | 192369 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9882 | |
| 600 | 177 | 1.1% |
| 700 | 162 | 1.0% |
| 800 | 159 | 1.0% |
| 500 | 155 | 1.0% |
| 400 | 134 | 0.8% |
| 1000 | 120 | 0.7% |
| 300 | 110 | 0.7% |
| 900 | 107 | 0.7% |
| 480 | 84 | 0.5% |
| Other values (270) | 5119 |
| Value | Count | Frequency (%) |
| 0 | 9882 | |
| 10 | 2 | < 0.1% |
| 20 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 50 | 8 | < 0.1% |
| 60 | 9 | 0.1% |
| 65 | 1 | < 0.1% |
| 70 | 6 | < 0.1% |
| 80 | 14 | 0.1% |
| 90 | 16 | 0.1% |
| Value | Count | Frequency (%) |
| 4820 | 1 | |
| 3500 | 1 | |
| 3480 | 1 | |
| 3260 | 1 | |
| 2850 | 1 | |
| 2730 | 1 | |
| 2620 | 1 | |
| 2610 | 1 | |
| 2600 | 1 | |
| 2590 | 1 |
yr_built
Real number (ℝ)
High correlation
| Distinct | 116 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1971.1528 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1915 |
| Q1 | 1952 |
| median | 1975 |
| Q3 | 1997 |
| 95-th percentile | 2011 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 29.372698 |
|---|---|
| Coefficient of variation (CV) | 0.01490128 |
| Kurtosis | -0.65458282 |
| Mean | 1971.1528 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -0.47268095 |
| Sum | 31950415 |
| Variance | 862.75542 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 415 | 2.6% |
| 2006 | 347 | 2.1% |
| 2005 | 346 | 2.1% |
| 2004 | 330 | 2.0% |
| 2007 | 323 | 2.0% |
| 2003 | 321 | 2.0% |
| 1977 | 311 | 1.9% |
| 1978 | 290 | 1.8% |
| 2008 | 289 | 1.8% |
| 1968 | 268 | 1.7% |
| Other values (106) | 12969 |
| Value | Count | Frequency (%) |
| 1900 | 65 | |
| 1901 | 17 | 0.1% |
| 1902 | 21 | 0.1% |
| 1903 | 33 | |
| 1904 | 32 | |
| 1905 | 55 | |
| 1906 | 72 | |
| 1907 | 49 | |
| 1908 | 66 | |
| 1909 | 74 |
| Value | Count | Frequency (%) |
| 2015 | 30 | 0.2% |
| 2014 | 415 | |
| 2013 | 144 | 0.9% |
| 2012 | 134 | 0.8% |
| 2011 | 99 | 0.6% |
| 2010 | 116 | 0.7% |
| 2009 | 168 | |
| 2008 | 289 | |
| 2007 | 323 | |
| 2006 | 347 |
yr_renovated
Real number (ℝ)
Zeros
| Distinct | 69 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82.738108 |
| Minimum | 0 |
|---|---|
| Maximum | 2015 |
| Zeros | 15537 |
| Zeros (%) | 95.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2015 |
| Range | 2015 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 397.86115 |
|---|---|
| Coefficient of variation (CV) | 4.8086807 |
| Kurtosis | 19.175936 |
| Mean | 82.738108 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.6013062 |
| Sum | 1341102 |
| Variance | 158293.49 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 15537 | |
| 2014 | 68 | 0.4% |
| 2005 | 31 | 0.2% |
| 2003 | 27 | 0.2% |
| 2000 | 27 | 0.2% |
| 2007 | 27 | 0.2% |
| 2013 | 24 | 0.1% |
| 2004 | 22 | 0.1% |
| 2009 | 18 | 0.1% |
| 1990 | 18 | 0.1% |
| Other values (59) | 410 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 15537 | |
| 1934 | 1 | < 0.1% |
| 1940 | 2 | < 0.1% |
| 1944 | 1 | < 0.1% |
| 1945 | 1 | < 0.1% |
| 1946 | 1 | < 0.1% |
| 1948 | 1 | < 0.1% |
| 1950 | 2 | < 0.1% |
| 1951 | 1 | < 0.1% |
| 1954 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2015 | 8 | < 0.1% |
| 2014 | 68 | |
| 2013 | 24 | 0.1% |
| 2012 | 7 | < 0.1% |
| 2011 | 8 | < 0.1% |
| 2010 | 13 | 0.1% |
| 2009 | 18 | 0.1% |
| 2008 | 16 | 0.1% |
| 2007 | 27 | 0.2% |
| 2006 | 15 | 0.1% |
zipcode
Real number (ℝ)
High correlation
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98077.975 |
| Minimum | 98001 |
|---|---|
| Maximum | 98199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 98001 |
|---|---|
| 5-th percentile | 98004 |
| Q1 | 98033 |
| median | 98065 |
| Q3 | 98117 |
| 95-th percentile | 98177 |
| Maximum | 98199 |
| Range | 198 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 53.355282 |
|---|---|
| Coefficient of variation (CV) | 0.0005440088 |
| Kurtosis | -0.84744549 |
| Mean | 98077.975 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 0.4029859 |
| Sum | 1.5897459 × 109 |
| Variance | 2846.7861 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98103 | 458 | 2.8% |
| 98038 | 449 | 2.8% |
| 98115 | 437 | 2.7% |
| 98117 | 434 | 2.7% |
| 98052 | 429 | 2.6% |
| 98042 | 427 | 2.6% |
| 98034 | 389 | 2.4% |
| 98006 | 380 | 2.3% |
| 98118 | 376 | 2.3% |
| 98133 | 371 | 2.3% |
| Other values (60) | 12059 |
| Value | Count | Frequency (%) |
| 98001 | 271 | |
| 98002 | 151 | 0.9% |
| 98003 | 203 | |
| 98004 | 235 | |
| 98005 | 125 | 0.8% |
| 98006 | 380 | |
| 98007 | 103 | 0.6% |
| 98008 | 207 | |
| 98010 | 75 | 0.5% |
| 98011 | 152 | 0.9% |
| Value | Count | Frequency (%) |
| 98199 | 233 | |
| 98198 | 208 | |
| 98188 | 103 | 0.6% |
| 98178 | 203 | |
| 98177 | 185 | |
| 98168 | 201 | |
| 98166 | 190 | |
| 98155 | 326 | |
| 98148 | 37 | 0.2% |
| 98146 | 221 |
lat
Real number (ℝ)
High correlation
| Distinct | 4775 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.560707 |
| Minimum | 47.1593 |
|---|---|
| Maximum | 47.7776 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 47.1593 |
|---|---|
| 5-th percentile | 47.31084 |
| Q1 | 47.4725 |
| median | 47.5724 |
| Q3 | 47.6782 |
| 95-th percentile | 47.74986 |
| Maximum | 47.7776 |
| Range | 0.6183 |
| Interquartile range (IQR) | 0.2057 |
Descriptive statistics
| Standard deviation | 0.13833962 |
|---|---|
| Coefficient of variation (CV) | 0.0029086956 |
| Kurtosis | -0.67433635 |
| Mean | 47.560707 |
| Median Absolute Deviation (MAD) | 0.1041 |
| Skewness | -0.48836173 |
| Sum | 770911.49 |
| Variance | 0.01913785 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.5402 | 14 | 0.1% |
| 47.6968 | 13 | 0.1% |
| 47.6624 | 13 | 0.1% |
| 47.6711 | 13 | 0.1% |
| 47.686 | 13 | 0.1% |
| 47.5659 | 13 | 0.1% |
| 47.6651 | 13 | 0.1% |
| 47.6754 | 13 | 0.1% |
| 47.6846 | 12 | 0.1% |
| 47.6853 | 12 | 0.1% |
| Other values (4765) | 16080 |
| Value | Count | Frequency (%) |
| 47.1593 | 1 | |
| 47.1622 | 1 | |
| 47.1647 | 1 | |
| 47.1776 | 2 | |
| 47.1803 | 1 | |
| 47.1879 | 1 | |
| 47.1896 | 2 | |
| 47.19 | 2 | |
| 47.1903 | 1 | |
| 47.1913 | 2 |
| Value | Count | Frequency (%) |
| 47.7776 | 3 | |
| 47.7775 | 2 | |
| 47.7774 | 1 | < 0.1% |
| 47.7772 | 2 | |
| 47.7771 | 2 | |
| 47.777 | 2 | |
| 47.7769 | 2 | |
| 47.7768 | 1 | < 0.1% |
| 47.7767 | 4 | |
| 47.7766 | 2 |
long
Real number (ℝ)
High correlation
| Distinct | 708 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.214 |
| Minimum | -122.519 |
|---|---|
| Maximum | -121.315 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 16209 |
| Negative (%) | 100.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | -122.519 |
|---|---|
| 5-th percentile | -122.387 |
| Q1 | -122.328 |
| median | -122.23 |
| Q3 | -122.125 |
| 95-th percentile | -121.979 |
| Maximum | -121.315 |
| Range | 1.204 |
| Interquartile range (IQR) | 0.203 |
Descriptive statistics
| Standard deviation | 0.14009338 |
|---|---|
| Coefficient of variation (CV) | -0.0011462957 |
| Kurtosis | 0.7645686 |
| Mean | -122.214 |
| Median Absolute Deviation (MAD) | 0.101 |
| Skewness | 0.83755585 |
| Sum | -1980966.8 |
| Variance | 0.019626156 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.29 | 93 | 0.6% |
| -122.362 | 84 | 0.5% |
| -122.284 | 79 | 0.5% |
| -122.3 | 78 | 0.5% |
| -122.288 | 77 | 0.5% |
| -122.299 | 76 | 0.5% |
| -122.372 | 76 | 0.5% |
| -122.351 | 73 | 0.5% |
| -122.291 | 73 | 0.5% |
| -122.292 | 72 | 0.4% |
| Other values (698) | 15428 |
| Value | Count | Frequency (%) |
| -122.519 | 1 | < 0.1% |
| -122.514 | 1 | < 0.1% |
| -122.512 | 1 | < 0.1% |
| -122.511 | 2 | |
| -122.509 | 2 | |
| -122.506 | 1 | < 0.1% |
| -122.505 | 3 | |
| -122.504 | 2 | |
| -122.503 | 2 | |
| -122.502 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -121.315 | 1 | |
| -121.316 | 1 | |
| -121.352 | 2 | |
| -121.402 | 1 | |
| -121.403 | 1 | |
| -121.405 | 1 | |
| -121.417 | 1 | |
| -121.473 | 1 | |
| -121.48 | 1 | |
| -121.691 | 1 |
sqft_living15
Real number (ℝ)
High correlation
| Distinct | 692 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1983.1523 |
| Minimum | 399 |
|---|---|
| Maximum | 6210 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 399 |
|---|---|
| 5-th percentile | 1137.4 |
| Q1 | 1480 |
| median | 1840 |
| Q3 | 2360 |
| 95-th percentile | 3290 |
| Maximum | 6210 |
| Range | 5811 |
| Interquartile range (IQR) | 880 |
Descriptive statistics
| Standard deviation | 681.90516 |
|---|---|
| Coefficient of variation (CV) | 0.34384912 |
| Kurtosis | 1.5794263 |
| Mean | 1983.1523 |
| Median Absolute Deviation (MAD) | 410 |
| Skewness | 1.094927 |
| Sum | 32144915 |
| Variance | 464994.65 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1560 | 144 | 0.9% |
| 1440 | 144 | 0.9% |
| 1540 | 139 | 0.9% |
| 1610 | 131 | 0.8% |
| 1500 | 130 | 0.8% |
| 1800 | 125 | 0.8% |
| 1680 | 124 | 0.8% |
| 1510 | 124 | 0.8% |
| 1480 | 122 | 0.8% |
| 1470 | 122 | 0.8% |
| Other values (682) | 14904 |
| Value | Count | Frequency (%) |
| 399 | 1 | < 0.1% |
| 460 | 2 | < 0.1% |
| 620 | 1 | < 0.1% |
| 690 | 2 | < 0.1% |
| 700 | 2 | < 0.1% |
| 710 | 1 | < 0.1% |
| 720 | 2 | < 0.1% |
| 740 | 8 | |
| 750 | 3 | < 0.1% |
| 760 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 6210 | 1 | < 0.1% |
| 6110 | 1 | < 0.1% |
| 5790 | 4 | |
| 5610 | 1 | < 0.1% |
| 5500 | 1 | < 0.1% |
| 5380 | 1 | < 0.1% |
| 5340 | 1 | < 0.1% |
| 5220 | 1 | < 0.1% |
| 5200 | 1 | < 0.1% |
| 5110 | 1 | < 0.1% |
sqft_lot15
Real number (ℝ)
High correlation
| Distinct | 7279 |
|---|---|
| Distinct (%) | 44.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12735.573 |
| Minimum | 651 |
|---|---|
| Maximum | 871200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 651 |
|---|---|
| 5-th percentile | 1960 |
| Q1 | 5098 |
| median | 7620 |
| Q3 | 10053 |
| 95-th percentile | 36939 |
| Maximum | 871200 |
| Range | 870549 |
| Interquartile range (IQR) | 4955 |
Descriptive statistics
| Standard deviation | 26933.162 |
|---|---|
| Coefficient of variation (CV) | 2.1147979 |
| Kurtosis | 123.53504 |
| Mean | 12735.573 |
| Median Absolute Deviation (MAD) | 2509 |
| Skewness | 8.7516042 |
| Sum | 2.064309 × 108 |
| Variance | 7.2539522 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 328 | 2.0% |
| 4000 | 267 | 1.6% |
| 6000 | 213 | 1.3% |
| 7200 | 149 | 0.9% |
| 7500 | 107 | 0.7% |
| 4800 | 106 | 0.7% |
| 8400 | 90 | 0.6% |
| 3600 | 87 | 0.5% |
| 5100 | 86 | 0.5% |
| 4080 | 83 | 0.5% |
| Other values (7269) | 14693 |
| Value | Count | Frequency (%) |
| 651 | 1 | < 0.1% |
| 748 | 1 | < 0.1% |
| 750 | 3 | |
| 755 | 1 | < 0.1% |
| 757 | 1 | < 0.1% |
| 758 | 1 | < 0.1% |
| 788 | 1 | < 0.1% |
| 794 | 1 | < 0.1% |
| 809 | 1 | < 0.1% |
| 810 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 871200 | 1 | |
| 438213 | 1 | |
| 434728 | 1 | |
| 422967 | 1 | |
| 411962 | 1 | |
| 386812 | 1 | |
| 380279 | 1 | |
| 360000 | 1 | |
| 339332 | 1 | |
| 335289 | 1 |
zip_median_price
Real number (ℝ)
High correlation
| Distinct | 67 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 485184.88 |
| Minimum | 235000 |
|---|---|
| Maximum | 1905000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 126.8 KiB |
Quantile statistics
| Minimum | 235000 |
|---|---|
| 5-th percentile | 259000 |
| Q1 | 332000 |
| median | 445000 |
| Q3 | 575000 |
| 95-th percentile | 779500 |
| Maximum | 1905000 |
| Range | 1670000 |
| Interquartile range (IQR) | 243000 |
Descriptive statistics
| Standard deviation | 195753.45 |
|---|---|
| Coefficient of variation (CV) | 0.40346156 |
| Kurtosis | 6.436611 |
| Mean | 485184.88 |
| Median Absolute Deviation (MAD) | 130000 |
| Skewness | 1.6480502 |
| Sum | 7.8643617 × 109 |
| Variance | 3.8319412 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 570000 | 694 | 4.3% |
| 575000 | 505 | 3.1% |
| 548500 | 458 | 2.8% |
| 339000 | 449 | 2.8% |
| 542500 | 434 | 2.7% |
| 620000 | 429 | 2.6% |
| 298000 | 427 | 2.6% |
| 445950 | 389 | 2.4% |
| 779500 | 380 | 2.3% |
| 361431 | 376 | 2.3% |
| Other values (57) | 11668 |
| Value | Count | Frequency (%) |
| 235000 | 352 | |
| 249000 | 91 | 0.6% |
| 258000 | 271 | |
| 259000 | 103 | 0.6% |
| 263000 | 203 | |
| 265000 | 37 | 0.2% |
| 266125 | 208 | |
| 270000 | 365 | |
| 275000 | 172 | |
| 277554 | 203 |
| Value | Count | Frequency (%) |
| 1905000 | 37 | 0.2% |
| 1110000 | 235 | |
| 997000 | 204 | |
| 940000 | 207 | |
| 779500 | 380 | |
| 762450 | 125 | 0.8% |
| 740000 | 142 | 0.9% |
| 739999.5 | 284 | |
| 716000 | 81 | 0.5% |
| 690000 | 79 | 0.5% |
zip_tier
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 126.8 KiB |
| 3 | |
|---|---|
| 1 | |
| 2 | |
| 4 | |
| 5 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 3598 | |
| 1 | 3248 | |
| 2 | 3240 | |
| 4 | 3139 | |
| 5 | 2984 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 3598 | |
| 1 | 3248 | |
| 2 | 3240 | |
| 4 | 3139 | |
| 5 | 2984 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3598 | |
| 1 | 3248 | |
| 2 | 3240 | |
| 4 | 3139 | |
| 5 | 2984 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3598 | |
| 1 | 3248 | |
| 2 | 3240 | |
| 4 | 3139 | |
| 5 | 2984 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3598 | |
| 1 | 3248 | |
| 2 | 3240 | |
| 4 | 3139 | |
| 5 | 2984 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16209 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3598 | |
| 1 | 3248 | |
| 2 | 3240 | |
| 4 | 3139 | |
| 5 | 2984 |
Interactions
Correlations
| bathrooms | bedrooms | condition | floors | grade | id | lat | long | price | sqft_above | sqft_basement | sqft_living | sqft_living15 | sqft_lot | sqft_lot15 | view | waterfront | yr_built | yr_renovated | zip_median_price | zip_tier | zipcode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| bathrooms | 1.000 | 0.524 | 0.130 | 0.551 | 0.657 | 0.017 | 0.008 | 0.257 | 0.496 | 0.693 | 0.189 | 0.745 | 0.566 | 0.064 | 0.058 | 0.109 | 0.110 | 0.563 | 0.045 | 0.211 | 0.121 | -0.203 |
| bedrooms | 0.524 | 1.000 | 0.021 | 0.228 | 0.377 | 0.011 | -0.029 | 0.187 | 0.340 | 0.539 | 0.228 | 0.645 | 0.437 | 0.209 | 0.197 | 0.034 | 0.000 | 0.176 | 0.019 | 0.101 | 0.072 | -0.171 |
| condition | 0.130 | 0.021 | 1.000 | 0.182 | 0.163 | 0.033 | 0.060 | 0.084 | 0.019 | 0.105 | 0.095 | 0.061 | 0.063 | 0.031 | 0.019 | 0.027 | 0.022 | 0.250 | 0.065 | 0.046 | 0.040 | 0.072 |
| floors | 0.551 | 0.228 | 0.182 | 1.000 | 0.503 | 0.021 | 0.026 | 0.148 | 0.318 | 0.597 | -0.273 | 0.398 | 0.299 | -0.240 | -0.234 | 0.017 | 0.017 | 0.560 | 0.013 | 0.171 | 0.106 | -0.063 |
| grade | 0.657 | 0.377 | 0.163 | 0.503 | 1.000 | 0.024 | 0.109 | 0.223 | 0.655 | 0.706 | 0.094 | 0.712 | 0.657 | 0.149 | 0.152 | 0.135 | 0.089 | 0.501 | 0.019 | 0.376 | 0.205 | -0.181 |
| id | 0.017 | 0.011 | 0.033 | 0.021 | 0.024 | 1.000 | -0.009 | 0.009 | 0.001 | 0.008 | -0.002 | 0.004 | 0.001 | -0.113 | -0.112 | 0.032 | 0.000 | 0.035 | -0.023 | 0.009 | 0.094 | -0.006 |
| lat | 0.008 | -0.029 | 0.060 | 0.026 | 0.109 | -0.009 | 1.000 | -0.149 | 0.458 | -0.029 | 0.123 | 0.031 | 0.031 | -0.124 | -0.118 | 0.071 | 0.028 | -0.124 | 0.028 | 0.606 | 0.514 | 0.248 |
| long | 0.257 | 0.187 | 0.084 | 0.148 | 0.223 | 0.009 | -0.149 | 1.000 | 0.063 | 0.389 | -0.207 | 0.284 | 0.386 | 0.374 | 0.378 | 0.093 | 0.105 | 0.413 | -0.078 | 0.098 | 0.251 | -0.579 |
| price | 0.496 | 0.340 | 0.019 | 0.318 | 0.655 | 0.001 | 0.458 | 0.063 | 1.000 | 0.536 | 0.255 | 0.640 | 0.570 | 0.073 | 0.061 | 0.200 | 0.309 | 0.099 | 0.106 | 0.745 | 0.222 | -0.011 |
| sqft_above | 0.693 | 0.539 | 0.105 | 0.597 | 0.706 | 0.008 | -0.029 | 0.389 | 0.536 | 1.000 | -0.166 | 0.843 | 0.693 | 0.269 | 0.252 | 0.081 | 0.065 | 0.470 | 0.033 | 0.203 | 0.129 | -0.284 |
| sqft_basement | 0.189 | 0.228 | 0.095 | -0.273 | 0.094 | -0.002 | 0.123 | -0.207 | 0.255 | -0.166 | 1.000 | 0.329 | 0.129 | 0.037 | 0.031 | 0.154 | 0.146 | -0.183 | 0.063 | 0.150 | 0.079 | 0.116 |
| sqft_living | 0.745 | 0.645 | 0.061 | 0.398 | 0.712 | 0.004 | 0.031 | 0.284 | 0.640 | 0.843 | 0.329 | 1.000 | 0.743 | 0.302 | 0.283 | 0.144 | 0.143 | 0.347 | 0.053 | 0.258 | 0.149 | -0.210 |
| sqft_living15 | 0.566 | 0.437 | 0.063 | 0.299 | 0.657 | 0.001 | 0.031 | 0.386 | 0.570 | 0.693 | 0.129 | 0.743 | 1.000 | 0.363 | 0.370 | 0.148 | 0.074 | 0.332 | -0.002 | 0.305 | 0.178 | -0.292 |
| sqft_lot | 0.064 | 0.209 | 0.031 | -0.240 | 0.149 | -0.113 | -0.124 | 0.374 | 0.073 | 0.269 | 0.037 | 0.302 | 0.363 | 1.000 | 0.923 | 0.034 | 0.035 | -0.047 | 0.007 | -0.061 | 0.032 | -0.321 |
| sqft_lot15 | 0.058 | 0.197 | 0.019 | -0.234 | 0.152 | -0.112 | -0.118 | 0.378 | 0.061 | 0.252 | 0.031 | 0.283 | 0.370 | 0.923 | 1.000 | 0.029 | 0.000 | -0.026 | 0.008 | -0.051 | 0.039 | -0.326 |
| view | 0.109 | 0.034 | 0.027 | 0.017 | 0.135 | 0.032 | 0.071 | 0.093 | 0.200 | 0.081 | 0.154 | 0.144 | 0.148 | 0.034 | 0.029 | 1.000 | 0.561 | 0.043 | 0.105 | 0.060 | 0.055 | 0.076 |
| waterfront | 0.110 | 0.000 | 0.022 | 0.017 | 0.089 | 0.000 | 0.028 | 0.105 | 0.309 | 0.065 | 0.146 | 0.143 | 0.074 | 0.035 | 0.000 | 0.561 | 1.000 | 0.041 | 0.088 | 0.025 | 0.028 | 0.078 |
| yr_built | 0.563 | 0.176 | 0.250 | 0.560 | 0.501 | 0.035 | -0.124 | 0.413 | 0.099 | 0.470 | -0.183 | 0.347 | 0.332 | -0.047 | -0.026 | 0.043 | 0.041 | 1.000 | -0.217 | -0.001 | 0.120 | -0.312 |
| yr_renovated | 0.045 | 0.019 | 0.065 | 0.013 | 0.019 | -0.023 | 0.028 | -0.078 | 0.106 | 0.033 | 0.063 | 0.053 | -0.002 | 0.007 | 0.008 | 0.105 | 0.088 | -0.217 | 1.000 | 0.062 | 0.057 | 0.060 |
| zip_median_price | 0.211 | 0.101 | 0.046 | 0.171 | 0.376 | 0.009 | 0.606 | 0.098 | 0.745 | 0.203 | 0.150 | 0.258 | 0.305 | -0.061 | -0.051 | 0.060 | 0.025 | -0.001 | 0.062 | 1.000 | 0.686 | -0.023 |
| zip_tier | 0.121 | 0.072 | 0.040 | 0.106 | 0.205 | 0.094 | 0.514 | 0.251 | 0.222 | 0.129 | 0.079 | 0.149 | 0.178 | 0.032 | 0.039 | 0.055 | 0.028 | 0.120 | 0.057 | 0.686 | 1.000 | 0.384 |
| zipcode | -0.203 | -0.171 | 0.072 | -0.063 | -0.181 | -0.006 | 0.248 | -0.579 | -0.011 | -0.284 | 0.116 | -0.210 | -0.292 | -0.321 | -0.326 | 0.076 | 0.078 | -0.312 | 0.060 | -0.023 | 0.384 | 1.000 |
Missing values
Sample
| id | date | price | bedrooms | bathrooms | sqft_living | sqft_lot | floors | waterfront | view | condition | grade | sqft_above | sqft_basement | yr_built | yr_renovated | zipcode | lat | long | sqft_living15 | sqft_lot15 | zip_median_price | zip_tier | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 9117000170 | 20150505T000000 | 268643 | 4 | 2.25 | 1810 | 9240 | 2.0 | 0 | 0 | 3 | 7 | 1810 | 0 | 1961 | 0 | 98055 | 47.4362 | -122.187 | 1660 | 9240 | 298475.0 | 1 |
| 1 | 6700390210 | 20140708T000000 | 245000 | 3 | 2.50 | 1600 | 2788 | 2.0 | 0 | 0 | 4 | 7 | 1600 | 0 | 1992 | 0 | 98031 | 47.4034 | -122.187 | 1720 | 3605 | 290000.0 | 1 |
| 2 | 7212660540 | 20150115T000000 | 200000 | 4 | 2.50 | 1720 | 8638 | 2.0 | 0 | 0 | 3 | 8 | 1720 | 0 | 1994 | 0 | 98003 | 47.2704 | -122.313 | 1870 | 7455 | 263000.0 | 1 |
| 3 | 8562780200 | 20150427T000000 | 352499 | 2 | 2.25 | 1240 | 705 | 2.0 | 0 | 0 | 3 | 7 | 1150 | 90 | 2009 | 0 | 98027 | 47.5321 | -122.073 | 1240 | 750 | 575000.0 | 4 |
| 4 | 7760400350 | 20141205T000000 | 232000 | 3 | 2.00 | 1280 | 13356 | 1.0 | 0 | 0 | 3 | 7 | 1280 | 0 | 1994 | 0 | 98042 | 47.3715 | -122.074 | 1590 | 8071 | 298000.0 | 1 |
| 5 | 464001025 | 20140918T000000 | 722500 | 4 | 3.50 | 2600 | 5100 | 2.0 | 0 | 0 | 3 | 8 | 1820 | 780 | 2003 | 0 | 98117 | 47.6948 | -122.395 | 2000 | 6720 | 542500.0 | 3 |
| 6 | 3432500486 | 20140623T000000 | 299995 | 2 | 1.00 | 1060 | 7200 | 1.0 | 0 | 0 | 4 | 6 | 1060 | 0 | 1951 | 0 | 98155 | 47.7463 | -122.315 | 1850 | 8291 | 381500.0 | 2 |
| 7 | 1126059095 | 20140526T000000 | 880000 | 3 | 2.00 | 2130 | 35169 | 1.0 | 0 | 0 | 4 | 8 | 2130 | 0 | 1989 | 0 | 98072 | 47.7489 | -122.123 | 2860 | 43560 | 530000.0 | 3 |
| 8 | 3876500290 | 20150305T000000 | 175000 | 3 | 1.00 | 1070 | 6164 | 1.0 | 0 | 0 | 3 | 7 | 1070 | 0 | 1967 | 0 | 98001 | 47.3377 | -122.291 | 1320 | 7920 | 258000.0 | 1 |
| 9 | 1865400075 | 20140522T000000 | 320000 | 3 | 2.25 | 998 | 844 | 2.0 | 0 | 0 | 3 | 7 | 798 | 200 | 2007 | 0 | 98117 | 47.6983 | -122.367 | 998 | 1110 | 542500.0 | 3 |
| id | date | price | bedrooms | bathrooms | sqft_living | sqft_lot | floors | waterfront | view | condition | grade | sqft_above | sqft_basement | yr_built | yr_renovated | zipcode | lat | long | sqft_living15 | sqft_lot15 | zip_median_price | zip_tier | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 16199 | 9543000205 | 20150413T000000 | 139950 | 0 | 0.00 | 844 | 4269 | 1.0 | 0 | 0 | 4 | 7 | 844 | 0 | 1913 | 0 | 98001 | 47.2781 | -122.250 | 1380 | 9600 | 258000.0 | 1 |
| 16200 | 7781600100 | 20140905T000000 | 1338750 | 3 | 2.75 | 2730 | 38869 | 1.5 | 1 | 4 | 3 | 9 | 1940 | 790 | 1963 | 2001 | 98146 | 47.4857 | -122.361 | 2630 | 28188 | 287000.0 | 1 |
| 16201 | 7215721350 | 20150422T000000 | 465000 | 3 | 2.50 | 1650 | 4636 | 2.0 | 0 | 0 | 3 | 8 | 1650 | 0 | 1999 | 0 | 98075 | 47.5997 | -122.016 | 1650 | 4504 | 739999.5 | 5 |
| 16202 | 6600220380 | 20140531T000000 | 538888 | 5 | 2.75 | 2080 | 13189 | 2.0 | 0 | 0 | 3 | 8 | 2080 | 0 | 1987 | 0 | 98074 | 47.6288 | -122.031 | 2030 | 11847 | 635000.0 | 5 |
| 16203 | 5078400160 | 20140605T000000 | 1800000 | 5 | 4.50 | 4400 | 15580 | 2.0 | 0 | 0 | 3 | 11 | 3390 | 1010 | 2003 | 0 | 98004 | 47.6232 | -122.207 | 2150 | 14249 | 1110000.0 | 5 |
| 16204 | 5272200045 | 20141113T000000 | 378000 | 3 | 1.50 | 1000 | 6914 | 1.0 | 0 | 0 | 3 | 7 | 1000 | 0 | 1947 | 0 | 98125 | 47.7144 | -122.319 | 1000 | 6947 | 425000.0 | 3 |
| 16205 | 9578500790 | 20141111T000000 | 399950 | 3 | 2.50 | 3087 | 5002 | 2.0 | 0 | 0 | 3 | 8 | 3087 | 0 | 2014 | 0 | 98023 | 47.2974 | -122.349 | 2927 | 5183 | 270000.0 | 1 |
| 16206 | 7202350480 | 20140930T000000 | 575000 | 3 | 2.50 | 2120 | 4780 | 2.0 | 0 | 0 | 3 | 7 | 2120 | 0 | 2004 | 0 | 98053 | 47.6810 | -122.032 | 1690 | 2650 | 610000.0 | 4 |
| 16207 | 1723049033 | 20140620T000000 | 245000 | 1 | 0.75 | 380 | 15000 | 1.0 | 0 | 0 | 3 | 5 | 380 | 0 | 1963 | 0 | 98168 | 47.4810 | -122.323 | 1170 | 15000 | 235000.0 | 1 |
| 16208 | 6147650280 | 20150325T000000 | 315000 | 4 | 2.50 | 3130 | 5999 | 2.0 | 0 | 0 | 3 | 7 | 3130 | 0 | 2006 | 0 | 98042 | 47.3837 | -122.099 | 3020 | 5997 | 298000.0 | 1 |